Picture for Kaiyuan Liu

Kaiyuan Liu

MCGA: A Multi-task Classical Chinese Literary Genre Audio Corpus

Add code
Jan 14, 2026
Viaarxiv icon

Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

Add code
Jan 14, 2026
Viaarxiv icon

Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation

Add code
Dec 24, 2025
Viaarxiv icon

FrontierCS: Evolving Challenges for Evolving Intelligence

Add code
Dec 17, 2025
Figure 1 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 2 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 3 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 4 for FrontierCS: Evolving Challenges for Evolving Intelligence
Viaarxiv icon

CCFQA: A Benchmark for Cross-Lingual and Cross-Modal Speech and Text Factuality Evaluation

Add code
Aug 10, 2025
Viaarxiv icon

Efficient Reasoning Through Suppression of Self-Affirmation Reflections in Large Reasoning Models

Add code
Jun 14, 2025
Figure 1 for Efficient Reasoning Through Suppression of Self-Affirmation Reflections in Large Reasoning Models
Figure 2 for Efficient Reasoning Through Suppression of Self-Affirmation Reflections in Large Reasoning Models
Figure 3 for Efficient Reasoning Through Suppression of Self-Affirmation Reflections in Large Reasoning Models
Figure 4 for Efficient Reasoning Through Suppression of Self-Affirmation Reflections in Large Reasoning Models
Viaarxiv icon

LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

Add code
Jun 13, 2025
Viaarxiv icon

GeoCAD: Local Geometry-Controllable CAD Generation

Add code
Jun 12, 2025
Figure 1 for GeoCAD: Local Geometry-Controllable CAD Generation
Figure 2 for GeoCAD: Local Geometry-Controllable CAD Generation
Figure 3 for GeoCAD: Local Geometry-Controllable CAD Generation
Figure 4 for GeoCAD: Local Geometry-Controllable CAD Generation
Viaarxiv icon

ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation

Add code
Mar 10, 2025
Figure 1 for ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Figure 2 for ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Figure 3 for ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Figure 4 for ProjectEval: A Benchmark for Programming Agents Automated Evaluation on Project-Level Code Generation
Viaarxiv icon

Moyun: A Diffusion-Based Model for Style-Specific Chinese Calligraphy Generation

Add code
Oct 10, 2024
Figure 1 for Moyun: A Diffusion-Based Model for Style-Specific Chinese Calligraphy Generation
Figure 2 for Moyun: A Diffusion-Based Model for Style-Specific Chinese Calligraphy Generation
Figure 3 for Moyun: A Diffusion-Based Model for Style-Specific Chinese Calligraphy Generation
Figure 4 for Moyun: A Diffusion-Based Model for Style-Specific Chinese Calligraphy Generation
Viaarxiv icon